Corpus: mwl_wikipedia_2016_30K

Other corpora

5.1.18 Words nearly always as next neighbors

Strong NN co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/NN_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency as NN Qoutient
Stados Ounidos 444 350 348 1.28
Países Baixos 48 42 42 1.14
Nuossa Senhora 26 32 25 1.33
Buenos Aires 18 25 18 1.39
per capita 27 23 22 1.28
Van Gogh 14 16 13 1.33
Mato Grosso 16 12 12 1.33
Niagara Falls 11 11 10 1.21
Treze Quelónias 10 10 10 1.00
Pearl Harbor 8 8 8 1.00
Daby Jones 5 7 5 1.40
Eigas Moniç 6 7 6 1.17
Negócios Strangeiros 7 6 6 1.17
Yasser Arafat 4 5 4 1.25
Te Ching 4 5 4 1.25
Hong Kong 6 5 5 1.20
Juscelino Kubitschek 7 5 5 1.40
Nur ad-Din 4 5 4 1.25
Tel Abib 3 4 3 1.33
Anterno Bruto 3 4 3 1.33
246 msec needed at 2018-01-06 01:38